Location: Limerick, Ireland | Hybrid
We are seeking an experienced Site Reliability Engineer (SRE) to join a growing Engineering team supporting a leading SaaS platform. You will ensure high availability, scalability, and performance of production, staging, and development environments, with a focus on automation, cloud operations, and production support.
Build, maintain, and monitor highly available cloud infrastructure (Linux/Windows).
Provide 24/7 production support and troubleshoot technical issues.
Collaborate with application, DBA, and cloud teams to deliver scalable solutions.
Implement automation and Infrastructure-as-Code (Terraform, Ansible, scripting).
Monitor and improve system performance using tools like Prometheus, Grafana, or ELK.
Ensure security best practices and disaster recovery processes are followed.
3+ years in SRE, DevOps, or Systems Administration roles.
Hands-on experience with AWS services (EC2, S3, Lambda, VPC, IAM, etc.).
Strong scripting/automation skills (PowerShell, Python, or similar).
Familiarity with containerization (Docker, Kubernetes, Helm).
Experience with multi-tier SaaS or microservices architectures.
Good understanding of networking, load balancing, and patch management.
CI/CD pipelines, Git, Azure DevOps.
SQL Server administration.
Experience in regulated environments or cloud security governance.
AWS certification or equivalent.
Apply now if you are a hands-on, proactive engineer passionate about building reliable, scalable cloud solutions.
Reperio Human Capital acts as an Employment Agency and an Employment Business.